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Recent development in video technology, such as the liquid crystal displays and shutters, 
have made it feasible to incorporate stereoscopic depth into the three-dimensional representations 
on two-dimensional displays. However, depth has already been vividly 

plays without stereopsis using the classical artists' depth cues described by Helmholtz (1866) and 
the dynamic depth cues described in detail by Ittleson (1952). Successful static ep cues inc u 
overlap, size, linear perspective, texture gradients, and shading. Effective dynamic cues include 
looming (Regan and Beverly, 1979) and motion parallax (Rogers and Graham, 1982). 

Stereoscopic depth is superior to the monocular distance cues under certain circumstances. It 
is most useful at portraying depth intervals as small as 5-10 arc seconds. For this reason it is 
extremely useful in user-video interactions such as in telepresence. Objects can be manipulated in 
3-D space, for example, while a person who controls the operations views a virtual image o t e 
manipulated object on a remote 2-D video display. Stereopsis also provides structure and form 
information in camouflaged surfaces such as tree foliage. Motion parallax also reveals form, how- 
ever without other monocular cues such as overlap, motion parallax can yield an ambiguous per- 
ception. For example, a turning sphere, portrayed as solid by parallax, can appear to rotate either 
leftward or rightward. However, only one direction of rotation is perceived when stereo-depth is 
included. If the scene is static, then stereopsis is the principal cue for revealing the camouflaged 
surface structure. Finally, dynamic stereopsis provides information about the direction of motion 
in depth (Regan and Beverly, 1979). When optical flow patterns seen by the two eyes move in 
phase, field motion is perceived in the ffonto-parallel plane. When optical flow is in antip ase ° 
(1 80°) motion is seen in the saggital plane. Binocular phase dispanty of optical flow as sma as 
can be discriminated as changes in visual direction of motion in a 3-D space (Beverly and Regan, 
1975). This would be a useful addition to the visual stimuli in flight simulators. 

Several spatial constraints need to be considered for the optimal stimulation of stereoscopic 
depth. The stimulus for stereopsis is illustrated in figure 1. Each peg subtends a \^ual ang e at 
the entrance pupils of the eyes, and this angle is referred to as binocular parallax. The Mference m 
this angle and the angle of convergence forms an absolute disparity. In the absence of monocular 
depth cues, perceived distance of an isolated target, subtending an absolute dispanty is biase 
toward 1.5 meters from the physical target distance. Gogle and Teitz (1973) referred to this as 
equidistance tendency. If the target moves abruptly from one distance to another, convergence 
responses signal the change of depth (Foley and Richards, 1972); however, smooth continuous 
changes in binocular parallax, tracked by vergence eye movements do not cause changes in per- 
ceived distance (Erkelens and Collewijn, 1985; Guttmann and Spatz, 1985). Once more than one 
disparate feature is presented in the field, differences in depth (stereopsis), stimulated by retinal 
image disparity become readily apparent. Stereothresholds may be as low as 2 sec arc, which 
ranks stereopsis along with vernier and bisection tasks among the hyperacuities. 
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crimn w ?°’ t s . ensitivity toa “g ular de Pth interval varies with the saggital distance of the 

ho™l d r h 1 f ncren ? ent fro u m the fixation plane. Sensitivity to depth increments is highest at the 
1 Q 7 m lxatlon P lan e where the disparity of one of the comparison stimuli is zero (Blakemore 
1970) This optimal condition for stereopsis was used by Tschermack (1930) as one of four crite- 
ria for defining the empirical longitudinal horopter. The Weber fraction describing the ratio of 
increment stereothreshold (arc sec) over the disparity pedestal (arc min) (3 sec/min) is fairly con- 
s ant with disparity pedestal amplitudes up to 1°. This fraction was derived from figure 2 which 
plots stereothreshold in seconds of arc at different saggital distances in minutes arc from the fixa- 
tion point for targets consisting of vertical bars composed of coarse or fine features. A two- 
altemative, forced choice is used to measure a just-noticeable difference between a depth increment 
between an upper test bar and a lower standard bar, both seen at some distance before or behind 

“° n p ane ' U u sed was a narrow ‘ band ’ s P atiall y filtered line produced from a differ- 

ence of Gaussians (DOG) whose center spatial frequency ranges from 9.5 to 0 15 cvcles/dee 

™nr^ d ff SCh0r ’ « Wt the slopes ofthese^unctions 

found with different width DOGs are the same on a logarithmic scale. However, thresholds for 

w spatial frequencies (below 2.5 cpd) are elevated by a constant disparity which illustrates thev 
^ eaf ^ d ^^P^ of thresholds found with higher spatial frequencies. These results illustrate that 
depth stimuli should be presented very near the plane of fixation, which is the video screen. 

nf f! * ere °;™ y remainsbi g h within fixation plane over several degrees about the point 
of fixation. Unlike the rapid reduction of stereo-sensitivity with overall depth or saggital distance 
from the h °ropter, stereo-sensitivity is fairly uniform and at its peak along the central 3° of the fix- 

nf? P H ( ® lakem ° re ’ 1970; . Sch °r and Badcock, 1985). Figures 2 and 3 illustrate a compLson 
of stereo-depth increment sensitivity for this fronto-parallel stereo and the saggital off- horopter 
stereothreshold. Also plotted in figure 3 are the monocular thresholds for defeating vernieroffset 
of the same DOG patterns at the same retinal eccentricities. Clearly, stereopsis remains at its S 

and C 'irrn CmeS h a ° ng f the horopterand there is a Percipitous fall of visual acuity (Wertheim, 1894) 
and, as shown here, of vernier acuity over the same range of retinal eccentricities where stereo 
increment sensitivity is unaffected (Schor and Badcock, 1985). Thus, stereoacuity is not limited 

of & " VOTiCr aC " ity beCaUSe “° thrcsh0lds difto * * 

disDaritv a nSmnM e h threS H° ld k°7° Werd j Pi ” ty Hmit <LDL) for SKre °Pt®. there is an upper 
iT£- * UDL) ’ beyond whlch sterco de P th can no longer be appreciated This upper limit is 
small, being approximately 10 arc min with fine (high-frequency) targets, and some3^ 

ThiTdenfh eSreeS ^ Wltb coarser ( low spatial frequency) fusion stimuli (Schor and Wood 1983) 

1956W Can ^ CXtCnded either by briefl y flashing targets (Westheimer and Tanzman ' 

approximately 24° S The § frnf movements between them (Foley and Richards, 1972) to a UDL of 
PP y ' he UDL presents a common pitfall for many stereo-camera displays that 

a emp to exaggerate stereopsis by placing the stereo-cameras far apart. Paradoxically this can 

prtjducu disparities that exceed the UDL and results in the collapse of depth imo *e fromo p^aHel 

slichtlvTmlto thTIhe) TO W f ™ ‘, hat accompam “ disparides. The diplopia threshold is 
s ghtly smaller than die UDL for static stereopsis, and depth stimulated by large flashed disparities 

is always seen diplopically. Normally, this diplopia can be minimized by shifting con™ C T 

a r? er - H0wever> AiS iS n0t as easily done wbh a stLeo-vidlo monitor In 

real space the stimulus for vergence is correlated with the stimulus for accommodation With video 
isplays, the stimulus for accommodation is fixed at the screen plane while vergence is an 
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independent variable. Because there is cross-coupling between accommodation ^ v er ^nce, we 
are not completely free to dissociate these motor responses (Schor and Kotulak, 1986). With some 
muscular effort, a limited degree of vergence can be expected while accommodation is fixed, 
depending on the accommodative-convergence ratio (AC/A). When this ratio is high, a person 
must choose between clearness and singleness. 


Additional problems for stereoscopic depth occur with abstract scenes containing high spatial 
frequency surface texture. This presents an ambiguous stimulus for stereopsis and fusion which 
can have an enormous number of possible solutions as illustrated by the wallpaper illusion or by a 
random-dot stereogram. The visual system uses various strategies to reduce the number of 
potential fusion combinations and certain spatial considerations of targets presented on the visual 
display can help implement these strategies. A common technique used in computer vision is the 
coarse-to-fine strategy. The visual display is presented with a broad range of spatial frequency 
content. The key idea here is that there is little confusion or ambiguity with coarse features like the 
frame of a pattern. These can be used to guide the alignment of the eyes into registration with finer 
features that present small variations in retinal image disparity. Once in registration, small 
disparities carried by the fine detail can be used to reveal the shape or form of the depth surface. 

An essential condition for this algorithm to work is that sensitivity to large disparities be greatest 
when they are presented with coarse detail and that sensitivity to small disparities be highest with 
fine (high spatial frequency) fusion stimuli. This size-disparity correlation has been verified or 
both the LDL and UDL by Schor and Wood (1983). Figure 4 illustrates the variation of stereo- 
threshold (LDL) and the UDL with spatial frequency for targets presented on a zero disparity 
pedestal at the fixation point. Stereothresholds are lowest and remain relatively constant for spatial 
frequencies above 2.5 cycles/deg. Thresholds increase proportionally with lower spatial ffe- 

quencies. Even though stereothreshold varies markedly with target coarseness, suprathreshold 

disparities needed to match the perceived depth of a standard disparity are less dependent on spatial 
frequency. This depth equivalence constitutes a form of stereo-depth constancy (Schor and 
Howarth, 1986). Similar variations in the diplopia threshold or binocular fusion limit are found by 
varying the coarseness of fusion stimuli (Schor, Wood, and Ogawa, 1984b). 


Figure 5 illustrates that the classical vertical and horizontal dimensions of Panum's fusion 
limit (closed and open symbols, respectively) are found with high spatial frequency targets, but the 
fusion limit increases proportionally with the spatial width of targets at spatial frequencies lower 
than 2 5 cycles/deg. When measured with high-frequency DOGs, the horizontal radius of PFA 
(Panum’s fusional area) is 15 min; and when measured with low-frequency stimuli, PFA equals a 
90° phase disparity of the fusion stimulus. 

The increase in Panum's fusion limit appears to be caused by monocular limitations to spatial 
resolution. For example, if the same two targets that were used to measure the diplopia threshold 
are both presented to one eye to measure a two-point separation threshold, such as the Rayleigh 
criterion, then the monocular and binocular thresholds are equal when tested with spatial frequen- 
cies lower than 2.5 cpd. At higher spatial frequencies we are better able to detect smaller separa- 
tions between two points presented monocularly than dichoptically. This difference at high spatia 
frequencies reveals a unique binocular process for fusion that is independent of spatial resolution. 
With complex targets composed of multiple spatial frequencies, at moderate disparities such as 
20 min arc, a diplopia threshold may be reached with high spatial frequency components while 
stereopsis and fusion may continue with the low spatial frequency components. An example of 
this simultaneous perception can be seen with the diplopic pixils in a random dot stereogram whose 
coarse camouflaged form is seen in vivid stereoscopic depth (Duwaer, 1983). 
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In addition to target coarseness, there are several other aspects of spatial configuration that 

WheamL^ns?^ 5 ^ US10n ' P e ? adl . tlonal studies of stereopsis, such as those conducted by 
* 38, mainly consider the dispanty stimulus in isolation from other disparities at the 
same or different regions of the visual field. It is said that disparity is processed locally in this 
limiting case, independent of other possible stimulus interactions other than the comparison 
between txvoabsoiute disparities to form a relative disparity. However, recent investigations have 
. ly illustrated that “ addltlon t0 the local processes, there are global processes in which spatial 
interaction between multiple relative disparities in the visual field can influence both stereopsis and 
usion. Three forms of global interactions have been studied. These are disparity crowding dis- 
parity gradients, and dispanty continuity or interpolation. These global interactions appear to 
influence phenomena such as the variation in size of Panum's fusional area, reductions and 
enhancement of stereo-sensitivity, constant errors or distortions in depth perception, and resolution 
or a 3-D form that has been camouflaged with an ambiguous surface texture. 

Spatial crowding of visual targets to less than 10 arc min results in a depth averaging of 

nra?! ea T!i ThlS 1S man l fest as an elevatlon of stereothreshold as well as a depression of the 
UDL (Schor, Bndgeman, and Tyler, 1983). The second global interaction, disparity gradient, 

T ^ d f a P° n ^P ac ‘ ng between disparate targets and the difference in their disparities. (Schor ’and 
S i! I 6 S panty gradient ^presents how abruptly disparity varies across the visual 
d The effect of dispanty gradients upon the sensory fusion range has been investigated with 

ani n ieS S H by ^ T and with P eriodic sinuosidal spatial variations in horizontal 
th d dispanty by Schor and Tyler (1981). Both groups demonstrate that the diplopia 

threshold increases according to a constant disparity gradient as the separation between adjacent 
fusion stimuli increases. Cyclofusion limits are also reduced by abrupt changes in disparity 
^: ee " neighb0nn ^ ret ! naI regions (Kertesz and Optican, 1974). Stereothresholds can also be 

Panty Sradie ^‘ targCt separation Creases, so does stereothreshold, 
, P f 15 arc mm separation. Further reduction in separation results in crowding which 

elevates the stereothreshold. The UDL is also limited by a constant disparity gradient (fif 5) As 
spacing decreases, there is a proportional decrease in the UDL. These gradient effects set two 
strict limitations on the range of stereopscopic depth that can be rendered by the video display. As 

armT ^ the UDL will decrease. The effect is that targets exceeding the UDL will 

appear diplopic and without depth. For example, a top-down picture of a forest which has trees of 
uneven height will not be seen as uneven depth if the trees are imaged too closely. To remedy this 
problem, the depth should be reduced by moving the stereocameras closer together. In the other 

3 Sh / n 7 S ° PC W1 n0t Seen in depth unless h ex ceeds the gradient for stereothresh- 
mait fh Ven lf U l° eS ’ U m f y Stdl n0t be seen lf h exten ds across the entire visual display Nor- 
mal y there can be unequal optical errors of the two eyes which produce unequal magnification of 

the two retinal images. This amso magnification produces an apparent tilt ofthe stereoscopic frame 
reference referred to as the fromo-parallel plane. However, this constant depth S* 
corrected or compensated for perceptually (Morrison, 1977). This perceptual compensation could 
reduce sensitivity to wide static displays of a shallow depth gradient. 

A third form of global interaction is observed under conditions where disparity differences 

f>nnfnh neigh r° nng ° CCUr t0 ° gradually to be detected, such as in the 3-D version of the 

Craik-Obnen Cornsweet illusion (fig. 6 by Anstis, Howard, and Rogers, 1978), when stereo pat- 

and^KerS^ T b ]h efly t0 be processed ful3 y (Ramachandran and Nelson, 1976; Mitchison 
and McKee, 1985), or when several equally probable, but ambiguous, disparity solutions are pre- 
sented m a region neighboring an unambiguous disparity solution (Kontsevich 1986) Underall 
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of these conditions, the depth percept resulting from the vague disparity is similar to or c ° n ^ nu ° u 
with the depth stimulated by the more visible portion of the disparity stimulus This illustrates the 
principle of depth continuity formulated by Julesz (1971) and restated later by Marr and Poggio 
(1979), which recently was shown by Ramachandran and Cavanaugh (1985) to include the exten- 
sion of depth to subjective contours in which no physical contour or disparity exists. 


Clearly there are many spatial constraints, including spatial frequency content, retinal eccen- 
tricity, exposure duration, target spacing, and disparity gradient, which— when properly 
adjusted— can greatly enhance stereodepth in video displays. 
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Figure 1. Retinal image disparity based on horizontal separation of the two eyes. 
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ftg^TtaeshoId *** cerements obtained, for observer D.B., as a function of pedestal size in 

hn^a and dlvergent directions. Functions illustrate results obtained with a thin 
bar and DOGs whose center spatial frequencies ranged from 0.15 to 9.6 c/deg. Panels C and D 
plot the performance measured when the comparison stimulus was a thin bright bar and the test 
stimulus was a DOG. Panels A and B show the results obtained when a DOG was used both 
as a comparison and as a test stimulus. Panels A and C plot stereothreshold on a log scale 
The data are replotted on a linear scale in panels B and D. 


2-10 









Retinal Eccentricity X 2 


Figure 3. A comparison is made of extra-foveal vernier threshold (solid line) with extra-foveal 
(mixed dashed line) and extra-horopteral (long dashed line) stereothresholds for a high spatial 
frequency stimulus (upper plot) and a low spatial frequency stimulus (lower plot). Note that 
retinal eccentricity has been doubled to be comparable to disparity pedestal. Over a 40 arc min 
range of retinal eccentricity, stereoacuity remained unchanged and vernier acuity increased 
moderately. A marked increase in stereothreshold occurred over a comparable (80 arc min) 
disparity pedestal range. 
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DISPARITY (arc min) 




CENTER SPATIAL PERIOD ( DEGREES) 

CENTER SPATIAL FREQUENCY (CYCLES /DEGREE) 

Figure 4. Upper and lower limits for stereopsis are plotted for two subjects as a function of DOG 
center spatial period along dashed curves at the top and bottom of data sets for uncrossed and 
crossed disparities respectively. Stereothreshold was lowest at small spatial periods 
(<0.42 arc min) and increased according to a 6° phase disparity between stereo-half images as 
spatial period increased. The upper limit increased proportionally to the square root of spatial 
period over the same range of broad spatial periods. Depth matching curves (solid lines) for 
several standard suprathreshold disparities (horizontal arrows) have flatter frequency responses 
than the upper and lower dashed threshold curves. Their breakaway point occurs at a higher 
spatial period for crossed than for uncrossed disparities. The luminance profile of the 
difference of two Gaussian functions is inset in the upper left comer. 
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Figure 5. Diplopia thresholds for two subjects are plotted as a function of bright bar width (B) of 
bar and difference of two Gaussian functions (DOG). Luminance profiles of these two test 
stimuli are inset below and above the data respectively. A constant phase disparity of 90 is 
shown by the dashed diagonal line. Horizontal and vertical Panum's fusion ranges (solid lines) 
coincide with the 90° phase disparity for DOG widths greater than 21 arc min. At the broadest 
DOG width the upper fusion limit equals the upper disparity limit for stereoscopic depth per- 
ception (bold dashed line). The standard deviation of the mean is shown for the broadest DOG 
stimulus. At narrow DOG widths, both horizontal and vertical fusion limits approach a con- 
stant minimum threshold. Panum's fusion ranges remain fairly constant when measured with 
bar patterns (dotted lines) and resemble values obtained with high spatial frequency DOGs. 


2-13 





Figure 6. Perspective sketch of the illusory depth surface. Left part looks apparently nearer than 
the right part. 
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